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1 Introduction 



Quantum information theory [^J is not merely a theoretical basis for physics 
of information and computation. It is also a source of challenging math- 
ematical problems, often having elementary formulation but still resisting 
solution. It appears that surprisingly little is known about what may be 
called the combinatorial geometry of tensor products of Hilbert spaces, even 
in finite dimensions. One group of open problems concerns the additivity 
properties of various quantities characterizing quantum channels, notably 
the capacity for classical information, and the "maximal output purity" , de- 
fined below. All known results, including extensive numerical work in the 
IBM group [|TJ, the Quantum Information group in the Technical Univer- 
sity of Braunschweig, and elsewhere, are consistent with the conjecture that 
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these quantities are indeed additive (resp. multiplicative) with respect to ten- 
sor products of channels. A proof of this conjecture would have important 
consequences in quantum information theory: in particular, according to this 
conjecture, the classical capacity or the maximal purity of outputs cannot be 
increased by using entangled inputs of the channel. 

In this paper we state the additivity/multiplicativity problems, give some 
relations between them, and prove some new partial results, which also sup- 
port the conjecture. 



2 Statement of the problem 

Let us give precise formulation of the additivity problem for the classical 
capacity (see f2|, ||). Let B(TC) be the * -algebra of all operators in a finite 
dimensional unitary space 7i. We denote the set of states, i.e. positive unit 
trace operators in B(H) by S(TC), the set of all m-dimensional projections 
by Vmi'H) and the set of all projections by V(7i). A quantum channel $ is a 
completely positive trace preserving linear map of B(7i) (we are in the finite 
dimensional case and we use the Schrodinger picture). These are the maps 
admitting the Kraus decomposition (see e. g. J7|, 0) 

s(p) = £4kM;, (l) 

k 

where A k are operators satisfying J2k A* k Ak = I. 

Let H(p) = —Tip log p denote the von Neumann entropy of the state p 
and define 

c($) = m«[irEft$W) - Y,Pi H (Hpi))l 

Pi, Pi 

% I 

where the maximum is taken over all finite probability distributions {pi} on 
S(TC), ascribing probabilities pi to (arbitrary) states pi. The quantity C($) 
appears as the "one-step classical capacity" of the quantum channel $ or 
the capacity with unentangled input states (we refer to |J for a detailed 
information-theoretic discussion and the proof of the corresponding coding 
theorem). A thorough discussion of the properties of C($) is given in ||. 



2 



The additivity problem can be formulated as follows: let $!,...,<&„ be 
channels in the algebras B(7i 1 ), . . . , B(TC n ) and let $1 <g> . . . <E> $„ be their 
tensor product in B(7i 1 ® . . . ® 7i n ) . Is it true that 

n 

C($ 1 ®...®$„)=E^(^) ? ( 2 ) 
1=1 

This obviously holds for reversible unitary channels; in || the additivity 
was established for the so called classical-quantum and quantum-classical 
channels, which map from or into an Abelian subalgebra of B(TC). 

Another closely related problem is the additivity of a quantity, which can 
be read as the "maximal output purity" of a channel. In fact, there are 
several quantities of this kind, depending on the way we measure "purity". 
If we just take the von Neumann entropy as a measure of purity, we arrive 
at the question [[§], || whether or not 

n 

pes(Hi®...®Hn) ■ 1 peS(H i ) 

For a particular class of channels this property implies (^) (see the Lemma 
in Section [| below). 

We will also consider this problem for other measures of purity, based on 
the noncommutative £ p -norms 

\\A\\ p = (Tr\A\P) l P , 

defined for p > 1, and A G 13(H), with the operator norm ||v4|| corresponding 
naturally to the case p = oo. For an arbitrary quantum channel $ let us 
introduce the following notations for the "highest purity" of outputs of a 
channel 

u H ($) = min#($(p)), (4) 
i/ p ($) = max||$(p)|| p , (5) 
z/_ 00 ($) = min||$(p)- 1 ||- 1 , (6) 

where the extrema are taken with respect to all input density matrices p . 
By convexity of the norms, the extrema in the above definitions are attained 
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on pure states (in the first (resp. last) case the operator convexity of the 
function x i— > xlogx (resp. x i-» is also relevant). 

Then the additivity/multiplicativity inequalities 

i/„($i®$ 2 ) > i/ p ($i)i/ p ($ 2 ) (7) 
Z/ ff ($!®$ 2 ) < V H ($i) +u H ($i) (8) 

are clear from inserting product density operators into the defining varia- 
tional expressions. The standing conjecture is that equality always holds in 
these inequalities, i.e. that choosing entangled input states is never helpful 
for getting purer output states. 

Before proceeding to show some new partial results on this problem, it is 
helpful to establish the relation between and ^ p ($) for p close to one. 

Of course, some relationship is expected, as the von Neumann entropy H(p) 
can be computed in terms of the derivative of \\p\\ p at p = 1 + . Here we find 
that if the equality holds in ( [?[) for p arbitrarily close to 1, then it holds also 
in (|). 

Proof. We shall use the fact that for every < x < 1 

1 — x p log x 



p — 1 log e 

if p I 1. Thus kz^iEE j s a monotonely increasing family of continuous 
functions of the variable p which varies in the compact set S(7i), converg- 
ing pointwise to the continuous function H(<&(p)). By Dini's Theorem, the 
convergence is uniform, and 

• tti^i w r 1 - max p Tr$(p) p 

mm H(§(p)) = hm 

p Pli p-1 

Therefore, if the equality holds in (^) for p close to 1, 

• tt,,* ^x/xx l-max p Tr(($ 1 ®$ 2 )(p)) p 

mm ® $ 2 ) (p)) = hm 

p p|i p — 1 

1 - max„Tr ($i(p)) p max p Tr ($ 2 (p)) p . . 
Pli p-1 P p 



□ 
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3 Tensoring with an ideal channel 



The first natural step is to establish the multiplicativity property when one 
factor is the identity channel. 

Lemma. For * = p, H, — oo 

<g> Id) = . (9) 

Since z/ p (Id) = 1, and i/#(Id) = 0, this is indeed an instance of the additiv- 
ity/multiplicativity conjecture. 

Proof. We shall restrict to the case *=p, l<p<oo. The argument 
in the case * = — oo is similar and the case * = H follows by the argument 
given above. 

Let us denote by 0.1,0.2 the Hilbert spaces of the first and the second 
system, respectively. Let 0i2 be a unit vector in 0\ ®Oi, and write pu = 
|0i2) (^12 1 and p\ = Tr2pi2 for the partial state in 0\. If $ is the channel in 
Oi, we denote p' 12 = ($ <8> Id)(pi 2 ). Let us dilate the channel $ to a unitary 
evolution C/13 with the environment O3, initially in a pure state p 3 = \4>3}((f>3\. 
The the final state of the environment is 

p , 3 ^Tr 1 U 13 (p 1 ^\cj )3 )( ( j )3 \)U* 3 = ^(p 1 ). 

Since the state of the composite system Oi ®02 ®0% remains pure after the 
unitary evolution, its partial states p' 12 ,p' 3 are isometric [0. Therefore 

\\{<$>®ld)(p X2 )\\ p = \\p' 12 \\ p = ||^|| p = ||*(pi)|| p . 

Now the map p\ — > ^f(pi) is affine and the norm is convex, therefore the 
maximum of the quantity above is attained on pure pi, whence pu = p\ ®P2, 
and the statement follows. □ 

We shall specifically need this Lemma in the case p = 00. It is instructive 
to see an alternative direct proof in this case. 

Proof. In what follows we take p = \(f))((f)\. We compute the operator norm 
of the Hermitian operator $(p) as ||$(p)|| = sup^(ip,^(p)ip), and take $ to 
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be given in the Kraus decomposition ([I] ). Then 

z/ oc ($) = sup]T(^, A k (j))((j),Al^) , 

where the supremum is over all unit vectors in the appropriate spaces. The 
expression under the supremum can be read as the £ 2 -norm of a vector with 
components (0, A%ijj) . We write this norm also as "the largest scalar product 
with a unit vector" Xi i- e -> 

= ( sup = (sup\\J2x^At^\\) (10) 

\<t>,^,x k / VV'.x k ) 

= sup\\J2xkA* k \\ 2 = sup\\J2xkA k \\ 2 , (11) 

X k X k 

where all suprema are over unit vectors. Obviously, the Kraus operators for 
$ <g> Id are A k <g> /, so 

^(^Id) = sup\\J2xk(A k ®I)\\ 2 = sup || \J2XkA k ) ®I\\ 2 = !/«,($) . 
* k * \ k J 

□ 

We will also need the analogous result for a quantity in which the two 
vectors 0, ip in the above proof are fixed to be the same: for any channel 
let 

i4($)=sup(V,$(|V)(V'IM, (12) 

where the supremum is again over all unit vectors. Note that this expression 
only makes sense, if the channel does not change the type of system, i.e., 
input and output algebra are the same. Then 

<g> Id) = z/t($) . (13) 

Proof: Again we use Kraus decomposition ([I]). For ip we use the Schmidt 
decomposition tp = e ^ ® where the e M and are orthonormal 

systems. Then the expression to maximized on the left-hand side becomes 

E (c^ c « c /3) 1/2 (e M ® e^, (A fc ® I)e„ <g> e^)(e Q <g> e' Q , (A* g> J) e/3 <g> e{,) 
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i.c^c v c a cp) l/2 {e^ A k e u ) (e Q , A* k e p ) 



fiua/3k 



J2c^c a (e^,A k e^)(e a ,A* k e a ) = ^ tr(p 1 A fc )tr(p 1 A^) , 




where p\ = c v\ e n) ( e ^l ^ s the reduced density matrix belonging to ip. Since 
the function p 1 \— > |tr(p!A fc )| 2 is convex, this expression attains its maximum 
with respect to ip when p 1 is pure, i.e., when tp is a product. □ 



One testing ground for the multiplicativity/additivity conjecture are chan- 
nels close to the identity. For such channels the purity parameters can be 
evaluated in lowest order in the deviation from the identity. Doing this for 
each subchannel and for their tensor product, one can explicitly check the 
conjecture. As the following result shows, this test supports the conjecture. 

Consider a channel with weak noise, i.e. choose some channel $ on B(7i), 
and set 



For small e this is a weak noise channel, which has the property that for any 
pure input the output will be nearly pure. 

Theorem. The multiplicativity hypothesis for the quantities with 
1 < p < oo and the additivity hypothesis for the quantity i>h{Q) hold true 
approximately in the leading order in e. 

Proof. In order to estimate these quantities for the weak noise channels, 
we need to estimate entropy, and the p-norms near a pure state. Let p 
be a density operator on a d-dimensional Hilbert space, and suppose that 
||p|| = 1 — e + o(e). Then the leading order of the other norms is determined 
completely by e: 



4 Weak Noise 



$W = (l _ e )ld + e$ . 



(14) 



||p||p = 1 — e + o(e) for p > 1 
H(p) = -eloge + o(eloge) , 



(15) 
(16) 
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where as usual o(e) stands for terms going to zero faster than e as e — > 0. In 
this case we can say more: in first line we have < remainder < C e p , for 
e < 1/2, where C is a constant depending only on the dimension. Similarly, 
the estimates in the second line are independent of the details of p. Hence in 
leading order all the variational expressions are equivalent: each one amounts 
to maximizing e. 

Let us go back to the weak noise channel (14). To get high fidelity we 
need to maximize the leading term, so we can take rj = £ in the following 
computation: 

Voo ($M) = sup<£,$ (e) (to>fal)0 

{(l-m,v)\ 2 + e{tM\v){v\)t)) 
= l-e + esup<£,$(|£><£l)0 +o(e) 

= l_ e + 6H ($) + (e). (17) 

Note that in all these estimates the remainder estimates can be done uni- 
formly for all channels, depending only on dimension. 



sup 



A tensor product of weak noise channels ([14] ) is again of the same form: 

$( £ ) = $f . . . $J) 

= (1 - e)"Id + e(l - e)"" 1 <g> Id 2 ... n + • • • + Mi..,.! ® $ n ) + o(e) 
= (1 - ne)Id + ne 5$ + o(e) , 

where 5$ is the average of the n channels Idi...fc_i <S> &k ® Idfc+i... n - Hence, in 
order to compute the leading order of ) by formula (0) we have to 

determine i^(5$). We have 



< ~5^($ fc ®Id) 

-| m 
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where we have used in turn: insertion of product states into the supremum 
defining 1^(5$), convexity of i\ as a supremum of affine functionals, and fi- 
nally the restricted additivity result (|T3|). Hence equality holds, which means 
that in the leading order in e all the variational expressions for the purity 
quantities with * = p,H,\) are attained at product states. □ 



5 Depolarizing Channels 



A channel is called bistochastic if = /, where I is the unit operator in 
B(7i). An important example is the depolarizing channel H 

$(p) = (1 - p)p + ^(Trp)/, p G B(H), < p < 1, 

where d = dim7i. A channel is called binary if d = 2. 

Lemma. Let $ be binary bistochastic channel, then 

C($) = log2- min H{®{p)). (18) 
pes(n) 

If $j are binary bistochastic channels, then (^) implies ( |2|). 

Proof. The < part of (|l8j) is obvious from the fact that for any channel 
C($) < logdimH- min H($(p)), (19) 

P<=S(H) 

so we need to prove only > part. Since the entropy is convex, the minimum 
is achieved at the set of extreme points of S{7i) which is Vi(H). Let p be 
the minimum point, then taking equiprobably po = p, pi = I — p, we obtain 

C($) > H{U>{I)) - \[H{S{p)) + H($(I - p))]. 

Since the channel is bistochastic, this is equal to H(\l) — |[/J($(p)) + H(I — 
$(p))], and since it is binary, this is equal to the right-hand side of (|l8l). 

To prove the second statement, it is sufficient to prove the < part of ( 
|2|), since > part follows from the definitions. But this follows from ( |l9l) and 

(USD. □ 
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In the paper || the relation(f2|) was proven for the two binary depolarizing 

channels $1, $2- The proof heavily uses Schmidt decomposition and as such 

does not generalizes to the case n > 2. The main difficulty is evaluating the 

entropy of the product channel. However, it appears to be possible to check 

the additivity in the limiting cases of "weak" and "strong" depolarization in 

the leading order. Let us consider a collection of depolarizing channels 

in the Hilbert spaces Hi, with parameters p iy d h i — 1, 2, . . . , n, and denote 

$ = <g>" =1 <I>i, H = &i =1 Hi, d = nr=i di. In the following we shall use symbols 

Ii and / = ®" =1 ij for the identity operators in Hi and H respectively. Let 

€l be the tensor product (f>i <g) . . . ® n , where (fii(p) = j-Tr(p)Ii, i G L C 

{1,2, . . . , n}, and 4>i(p) = p otherwise, p G SiTLj). Then is a conditional 

expectation onto the subalgebra A4l, generated by operators of the form 

Ax ® . . . ® A n , where Ai = Ii for i 6 I, and is normalized partial trace with 

respect to its commutant. It has the property min dimP = TTi 1 -! d^ L ^\ 

p e v(M L ) 1-1 * 

where Ol(i) — 1 if i G L and 0l{%) = otherwise. Here we denoted by 
V(A4l) the set of all orthogonal projections in M.l- Notice that the inclusion 
M Ll C Ml 2 holds if L 2 C L x . So e Ll e L2 = e LlVL2 . 

We shall use the expansion 

n 

^ = EIlft i(4) (i-^) 1 ^ (i W (20) 

L i=l 

Weak (strong) depolarization corresponds to the case where all pi (respec- 
tively, 1 — are small parameters, which we assume to be of the same 
order. 

Proposition. The relation holds in the cases of weak and strong 
depolarization approximately in the leading order. 

Proof. In the case of weak depolarization the statement follows from the 
Theorem in Section |j. 

In the case of strong depolarization we have to retain all the terms up to 
the second order in qi = 1 — pi. Then the leading terms are 



$(P) ~ d 



-i 



1 + Y1 Qi(diPi - I) + - diPi - djPj + didjPij) 
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where we denoted d = EKLi di, Pi is the partial state of P in the i— th Hilbert 
space, multiplied by the unit operator in the tensor product of the remaining 
Hilbert spaces, and similarly P^. Denoting the first (second) sum in the 
squared brackets A\ (respectively A 2 ) one easily sees that both are traceless 
operators. Moreover, up to the second order, 



$(P) log$(P) ~ d- 1 

and 



[1 + Ai + A 2 ) logcT 1 +U 1 + A 2 + ^ 



TrA 2 

pr($(p))~iogrf-^i. 

But 

n 

TrAl = dJ2qUdiTrpl-l), 
i=i 

where pi is the partial state of P in the i— th Hilbert space, which is maxi- 
mized if and only if pi is one-dimensional projection, i.e. P = p± <8> • • • <8> p n -^ 

Partial answers to the multiplicativity hypothesis are given by the follow- 
ing Theorem. In fact, multiplicativity of foo(^) for binary bistochastic maps 
follows from a more general result in 0. 



Theorem. 



n n 



i=l i=l 



^oo($) = n ^oo(^) = n (1 - 

1=1 i=l v 



w 

n n 

{in) z/_ 00 ($) = n ^-oo(^) = n f 

i=l i=l 1 



Pi(rfj-l) 



Proof, (i) It follows from the relation min dimP = IE-i ^ L(i) that 

Tr(6 Ll (P)e i2 (P)) = Tr(P6 LlVi2 (P)) = Tr(e LlVL2 (P) 2 ) < f[ d ^(^^ 

%=i 

for arbitrary P G Vi(7i) and equality holds only for factorizable projections. 
Hence 

n 

Tr(($(P)) 2 ) = TriiYsUP^il-Pi^MP)) 2 ) 

L i=l 
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< E f\p . Ll ® +6L2 ®(l _p i ) 2 - e £i( i )-^2( i )rf _max(eZjl( * ) ' 6,z ' 2(i)) 
Li,L 2 i=l 



n n / J -r -i 

n e p! i+9 ii-# 9i4 ^r" (9iW = n V^-^+j 

1=101, 02=0,1 1=1 V ^ a ' 



fii) Let us estimate 



\mp)\\ <EIIPi (i-ft) 1 -^ !!^)!! 

L i=l 

, ? n(ir P -^-n(i-^). 

which proves the second statement. 
(Hi) We have 

iiwr^ziw-^ii* ( 2i ) 



fc=0 

Let us calculate 

max ||$(I-P)||< max \\&(Q)\\. 

We have 

n 

\mQ)\\<Upi\\^,..,n}(Q)\\ 

1=1 

+ E flpi L( ^-Pi) 1 - eLW \MQ)\\< 

L^{l,2,...,n}i=l 

{i-d-')f[p i+ E f[p! u Hi-Pi) 1 ^ L ® = i-f[j- (22) 

i=l L^{l,2,...,n} i=l i=l ° l 

Here we have used the equality X) J] — Pi) 1-6 ^** = 1. Substituting 

L i=l 

(p2[) into ( |2~1|) , we get the last statement. □ 
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